Deeply-Supervised Nets
نویسندگان
چکیده
We propose deeply-supervised nets (DSN), a method that simultaneously minimizes classification error and improves the directness and transparency of the hidden layer learning process. We focus our attention on three aspects of traditional convolutional-neuralnetwork-type (CNN-type) architectures: (1) transparency in the effect intermediate layers have on overall classification; (2) discriminativeness and robustness of learned features, especially in early layers; (3) training effectiveness in the face of “vanishing” gradients. To combat these issues, we introduce “companion” objective functions at each hidden layer, in addition to the overall objective function at the output layer (an integrated strategy distinct from layer-wise pretraining). We also analyze our algorithm using techniques extended from stochastic gradient methods. The advantages provided by our method are evident in our experimental results, showing state-of-the-art performance on MNIST, CIFAR-10, CIFAR-100, and SVHN.
منابع مشابه
Coupling Supervised and Unsupervised Techniques in Training Feed-Forward Nets
A popular approach to training feed-forward nets is to treat the problem of adaptation as a function approximation and to use curve fitting techniques. We discuss here the problems which the use of pure curve fitting techniques entail for the generalization capability and robustness of the net. These problems are in general inherently associated with the use of pure supervised learning techniqu...
متن کاملA Comparative Evaluation of Deep Belief Nets in Semi-supervised Learning
In this report I studied the performance of deep belief nets (DBNs) on semisupervised learning problems, in which only a small proportion of data are labeled. First the performance between DBNs and support vector machines (SVMs) are compared to investigate the advantage of deep models over shallow ones. I also explored the use of DBNs as pre-training for SVMs and feed-forward nets (FFNs). The e...
متن کاملExtended Simple Colored Petri Nets
We present a new class of Petri Nets, called Extended Simple Colored Petri Nets (ESCP-nets), which are essentially Simple Colored Petri Nets (SCP-nets) with three added features: first, there is a built-in type of real numbers; second, tokens can be forced to wait in places; third, an interface specifies how the Net can be externally supervised. Even if these added features also add complexity,...
متن کاملNeural Networks with Java: neural net overview
What are neural nets? The biological model: The human brain The components of a neural net What they can and where they fail The learning process What means "learning" refering to neural nets? Supervised and unsupervised learning Forwardpropagation Backpropagation Selforganization
متن کاملSupervised models for an OpenAI driving game
Using the Open AI framework, we will use supervised learning with support of linear models and deep neural nets, in order to try to achieve honorable score at playing a driving game without human intervention.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1409.5185 شماره
صفحات -
تاریخ انتشار 2015